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SOLUTIONS  NEAR  SINGULAR  POINTS  TO  THE  EIKONAL 
AND  RELATED  FIRST  ORDER  NON-LINEAR  PARTIAL 
DIFFERENTIAL  EQUATIONS  IN  TWO  INDEPENDENT 

VARIABLES 

EMIL  CORNEA*,  RALPH  HOWARD^,  AND  PER-GUNNAR  MARTINSSON* 


Abstract.  A  detailed  study  of  solutions  to  the  first  order  partial  differ¬ 
ential  equation  H(x,y,  zx,  zy)  =  0,  with  special  emphasis  on  the  eikonal 
equation  z%  +  zy  =  h(x,y),  is  made  near  points  where  the  equation  be¬ 
comes  singular  in  the  sense  that  dH  =  0,  in  which  case  the  method  of 
characteristics  does  not  apply.  The  main  results  are  that  there  is  a  strong 
lack  of  uniqueness  of  solutions  near  such  points  and  that  solutions  can  be 
less  regular  than  both  the  function  H  and  the  initial  data  of  the  problem, 
but  that  this  loss  of  regularity  only  occurs  along  a  pair  of  curves  through  the 
singular  point.  The  main  tools  are  symplectic  geometry  and  the  Sternberg 
normal  form  for  Hamiltonian  vector  fields. 


1.  Introduction 

The  eikonal  equation  in  two  independent  variables  for  z  =  z(x,y)  is 
(1.1)  z2x  +  z2y  =  h(x,y) 

where  h  is  a  non-negative  smooth  function  on  the  plane,  and  subscripts  denote 
partial  derivatives.  Near  points  where  h  yf  0  all  local  solutions  to  this  equation 
can  be  constructed  by  the  method  of  characteristics  (c/.  [2,  Chap.  2],  [17, 
Chap.  10])  and  questions  of  local  existence,  uniqueness,  and  regularity  are 
fully  understood.  However,  near  points  where  h  vanishes  the  picture  is  much 
less  complete.  Our  main  goal  is  to  study  these  questions  in  detail  and  show 
that  near  a  point  where  h  has  zero  as  a  non-degenerate  local  minimum  value 
there  are  generally  infinitely  many  local  solutions.  These  results  are  extended 
to  solutions  of  certain  Hamilton- Jacobi  equations  H(x,y,  zx,  zy)  =  0  near  a 
rest  point  of  the  corresponding  characteristic  vector  field. 
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As  a  model  problem  we  will  use 

(1.2)  z\  +  z\  =  a2x2  +  b2y2 

where  a  and  b  are  positive.  For  ease  of  statement  we  discuss  our  general  results 
in  terms  of  this  special  case.  Normalizing  the  solution  so  that  z( 0,  0)  =  0,  the 
functions  z±t±  =  |( ±ax 2  ±fa/2)  clearly  solve  (1.2).  Moreover,  it  can  be  shown 
(see  Proposition  4.1  below)  that  any  C3  solution  to  (1.2)  near  (0,0)  is  of  the 
form  z  =  2±)±  +  O  ((|x|  +  M)3) . 

The  solution  z+t+  is  unique  in  the  sense  that  any  C2  solution  5  =  z+t+  + 
0((|x|  +  \y\ )3)  agrees  with  z+t+  in  a  neighborhood  of  (0,0)  (and  on  all  of  R2 
if  5  is  globally  defined),  with  a  similar  statement  holding  for  Z--.  (See  [4]  or 
Theorem  2.7  below  for  the  local  result  and  Oliensis  [12]  for  the  global  version). 
However,  the  two  saddle  point  solutions  z+)_  and  Z-t+  are  far  from  unique. 
We  demonstrate  (Theorem  3.2)  that  there  is  an  infinite  dimensional  family  of 
C°°  solutions  of  the  form  5  =  z+-  +  0((|x|  +  |r/|)3)  that  do  not  agree  with  z+)_ 
in  any  neighborhood  of  the  origin,  with  a  similar  statement  holding  for  . 
(This  gives  counterexamples  to  some  claims  in  the  literature,  cf.  [15,  p.  1094], 
which  would  imply  that  z±t±  are  the  only  solutions.)  We  also  show  that  for 
each  k  >  2  there  are  solutions  that  are  Ck .  but  not  Ck+1.  The  solutions,  z,  are 
obtained  by  looking  for  the  graph  of  the  differential  of  z,  that  is  the  manifold 
of  one  jets  of  2, 

N2  =  {( x,y,zx(x,y),zy(x,y ))  :  (x,y)  E  U}  C  R4, 

over  an  open  set  U  C  R2.  We  first  note  that  the  manifold,  N2,  for  z+)_  is 
obtained  by  letting  the  two  curves 

7± :  s  i-A  [s,  ±s,  as,  Tbs]1 

flow  under  the  characteristic  vector  field  of  (1.2)  and  taking  the  closure  of  the 
result.  Now  an  infinite  dimensional  family  of  saddle  point  solutions  can  be 
constructed  by  using  as  initial  data  for  the  flow,  not  the  curves  j±,  but  curves 
7±  obtained  by  perturbing  in  certain  directions.  If  are  Ck  and  agree  with 
7±  to  order  l  <  k  at  the  origin  then  the  solution  will  be  Ck+ 1  on  R2  \  {xy  =  0}, 
as  expected.  But,  surprisingly,  when  considered  as  a  function  on  all  of  R2  the 
regularity  is  only  Cn  with  n  :=  |~min{^^a,  —  1  (where  [•]  is  the  ceiling 

function)  and  so  there  is  a  drop  of  regularity  along  the  coordinate  axes. 

This  existence,  lack  of  uniqueness,  and  jump  in  regularity  along  a  pair  of  dis¬ 
tinguished  curves  is  not  specific  to  (1.2)  but  generalizes  to  the  eikonal  equation 
(1.1)  with 

(1.3)  h{x,y)  =  a2x2 +  b2y2 +  0{{\x\  +  \y\)3),  a,b>  0, 

under  the  extra  assumption  that  the  numbers  a  and  b  are  linearly  independent 
over  the  rational  numbers.  More  generally,  the  results  are  stated  and  proven 
in  Section  3.2  for  certain  Hamilton- Jacobi  equations  H(x,y,zx,zy)  =  0  near 
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a  rest  point  of  the  characteristic  vector  field  £#  of  H  (see  (2.3)  below).  In 
the  latter  case  it  is  assumed  that  the  eigenvalues  of  the  linearization  of  £# 
at  the  rest  point  are  a,  b,  —a,  and  —b,  with  a,  b  >  0.  The  proof  is  based  on 
using  the  Sternberg  normal  form  for  such  an  equation  (and  the  existence  of 
this  normal  form  is  only  guaranteed  when  a  and  b  are  linearly  independent 
over  the  rationals)  to  reduce  the  calculations  to  manageable  proportions. 

The  somewhat  unexpected  condition  that  a  and  b  be  linearly  independent 
over  the  rational  numbers  is  supported  by  a  formal  power  series  argument.  We 
show  that  equation  (1.1),  with  h  as  in  (1.3),  has  a  unique  formal  power  series 
solution  of  the  form 

(1.4)  z=  ^{ax2 -by2) +  0{{\x\  +  \y\)3) 

precisely  when  this  condition  holds  (see  Theorems  4.2  and  4.4  below).  More¬ 
over,  we  show  that  if  a  and  b  are  linearly  dependent  over  the  rationals  and  m 
and  n  are  integers  such  that  ma  —  nb  =  0  and  m  +  n  >  4,  then  the  equation 
z2  +  z2  =  ax2  +  by2  +  xmyn  has  no  Ck  solution  of  the  form  (1.4),  for  k  >  m  +  n. 
Thus  the  independence  condition  on  a  and  b  is  sufficent  and  very  close  to  be¬ 
ing  necessary.  The  formal  power  series  approach  has  previously  been  used  for 
analyzing  the  convex  and  concave  solutions,  see  Bruss  [4]  and  the  references 
therein. 

A  secondary  goal  of  this  paper  is  to  advocate  the  use  of  differential  geometric 
methods,  in  particular  differential  forms,  symplectic  geometry,  and  normal 
forms  such  as  the  Sternberg  normal  form,  in  working  with  first  differential 
order  equations.  To  this  end  we  have  included  some  expository  material  in 
Section  2  about  symplectic  geometry  and  its  application  to  the  method  of 
characteristics  for  first  order  differential  equations. 

The  article  can  be  summarized  as  follows. 

In  Section  2  we  present  the  basic  methods  of  symplectic  geometry  as  applied 
to  the  method  of  characteristics  for  first  order  differential  equations  of  the  form 
H(x,y,zx,zy)  =  0.  For  several  reasons  the  theory  is  easier  in  two  dimensions 
and  our  hope  is  that  this  will  be  useful  in  understanding  the  method  in  a 
concrete  setting.  This  section  also  gives  a  proof  of  a  slight  generalization  of  a 
Theorem  of  Bruss  [4,  p.  892]  on  the  existence  of  concave  and  convex  solutions 
to  the  eikonal  equation  (1.1). 

In  Section  3  we  first  discuss  in  detail  the  eikonal  model  equation  (1.2).  We 
prove  the  existence,  non-uniqueness  and  lack  of  regularity  results  mentioned 
above.  We  then  look  at  more  general  equations  of  the  form  H(x,  y,  zx,  zy )  =  0. 
The  same  type  of  results  hold,  but  we  were  not  able  to  give  quite  as  precise 
statements  for  the  regularity.  The  proofs  make  essential  use  of  both  symplectic 
geometry  and  the  Sternberg  normal  form. 

In  Section  4  we  discuss  the  solutions  to  the  eikonal  equation  (1.1)  in  a  formal 
power  series  framework.  At  this  level  we  prove  the  existence  and  uniqueness 
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of  saddle  point  solutions.  This  leads  to  examples  of  functions  h  for  which  the 
eikonal  equation  does  not  have  any  smooth  solution  of  saddle  type. 

The  article  ends  with  Appendix  A  which  includes  precise  statements  of 
two  of  the  geometric  tools  we  use:  the  stable  submanifold  theorem  and  the 
Sternberg  normal  form. 

Finally,  we  mention  that  the  eikonal  equation  and  certain  related  Hamilton- 
Jacobi  equations  have  been  of  interest  in  computer  vision  because  of  the  so- 
called  “shape  from  shading  problem.”  This  problem  corresponds,  roughly 
speaking,  to  the  reconstruction  of  a  shape  (a  surface)  z  =  z(x,y)  from  a  gray¬ 
scale  image  of  it.  That  is,  the  function  z  is  a  solution  of  the  Horn  image 
irradiance  equation  (see  [7],  Ch.  10),  which  under  some  assumptions  reduces 
to  the  eikonal  equation  [10].  Note  that  in  applications  the  data  function  h 
typically  has  quite  low  regularity.  In  this  case  viscosity  solutions  have  been 
considered  as  an  alternative  to  classical  solutions,  see  [10],  [14]  and  the  refer¬ 
ences  therein. 

2.  Application  of  symplectic  geometry  to  the  eikonal  equation 

2.1.  Review  of  the  method  of  characteristics  and  construction  of  con¬ 
cave  and  convex  solutions.  Let  R4  have  coordinates  x,y,p,q.  Then  the 
symplectic  form  on  R4  is 

u  :=  dp  A  dx  +  dq  A  dy. 

Let  N 2  C  R4  be  an  imbedded  surface.  Then  N 2  projects  on  an  open  set 
U  C  R2  iff  there  are  continuous  functions  p{x,y )  and  q(x,y )  so  that 

(2.1)  N2  =  {(x,y,p(x,y),q(x,y))  :  (x,y)  G  U}. 

The  submanifold  N 2  is  a  jet  of  a  function  iff  there  is  an  open  set  U  C  R2 
and  a  function  z  G  Cl(U)  such  that 

(2.2)  N2  =  {(x,y,zx(x,y),zy(x,y))  :  (x,y)  G  U}. 

It  is  clear  that  if  N 2  is  a  jet  of  a  function,  then  it  projects,  but  the  converse 
is  not  true.  The  following  is  a  standard  result,  but  we  include  the  short  proof 
for  those  not  familiar  with  the  differential  geometric  set  up. 

2.1.  Proposition.  Assume  that  N 2  C  R4  is  a  simply  connected  two  dimen¬ 
sional  submanifold  of  R4  of  smoothness  class  Ck  for  some  k  >  1.  Then 
N2  c  R4  is  the  jet  of  some  function  (which  will  be  unique  up  to  an  addi¬ 
tive  constant)  if  and  only  if  it  projects  over  some  open  set  U  C  R2  and  the 
restriction  of  the  symplectic  form  to  N 2  vanishes.  (That  is,  u(X,Y)  =  0  for 
all  vectors  tangent  to  N2 .)  Moreover,  if  N2  is  of  class  Ck  and  is  the  jet  of  a 
function  z,  then  z  is  of  class  Ck+1 . 
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Proof.  Assume  that  N2  projects  over  U  so  that  N 2  =  {(x,y,p(x,y),q(x,y))  : 
( x,y )  G  U}.  As  N2  is  simply  connected  the  same  is  true  of  U.  Using  x,y  as 
coordinates  on  N2,  we  see  that  the  restriction  of  the  symplectic  form  to  N2  is 

u  =  dp  A  dx  +  dq  A  dy  =  (px  dx  +  py  dy )  A  dx  +  (qx  dx  +  qy  dy )  A  dy 

=  (Qx  -  Py )  dx  A  dy. 

Therefore  the  restriction  of  u  to  N2  vanishes  iff  py  =  qx.  But  as  U  is  simply 
connected  this  is  exactly  the  condition  that  there  is  a  function  z  (unique  up 
to  an  additive  constant)  so  that  p  =  zx  and  q  =  zy.  If  N2  is  of  class  Ck  then 
p{x,y )  and  q{x,y)  are  Ck  functions  of  (x,y).  Thus  z  is  of  class  Ck+1.  □ 

A  two  dimensional  surface  N 2  C  R4  is  a  Lagrangian  surface  iff  the 
restriction  of  u  to  N 2  vanishes. 

Given  a  function  H :  R4  — »  R  the  characteristic  vector  field  of  H  (also 
called  the  symplectic  gradient )  is  the  unique  vector  yield  (h  on  R4  such 
that  for  all  vectors  X 

u(£H,X)  =  -dH(X). 

Then  a  simple  calculation  yields  that 

8H  d  dH  d  dH  d  dH  d 

H  dp  dx  +  dq  dy  dx  dp  dy  dq 

An  immediate  consequence  of  the  definition  of  £#  is  that 

dH(£H)  =  —co(^h,Ch)  =  0 

and  so  H  is  constant  on  the  integral  curves  of 

The  following  is  the  differential  geometric  justification  of  the  method  of 
characteristics  for  the  Hamilton- Jacobi  equation  H(x,y,  zx,  zy)  =  0.  Given  a 
smooth  function  H :  R4  — >  R  and  P  G  R4,  the  restriction  of  dH  to  T(R4)p  is 
denoted  dHp. 

2.2.  Proposition.  Let  N 2  be  a  simply  connected  two  dimensional  submani¬ 
fold  of  R4.  We  also  assume  that  the  following  non- degeneracy  condition  holds: 

(ND)  The  set  of  points  P  G  N 2  where  dHp  7^  0  is  dense  in  N 2  (here  dHp 
is  being  viewed  as  a  linear  functional  on  R4  and  not  just  restricted  to 
T(N2) ). 

Then  N 2  is  the  jet  of  a  solution  to  the  equation  H(x,  y,  zx,  zy)  =  0  if  and  only 
if  the  following  three  conditions  hold 

1.  N2  C  {P  :  H(P)  =  0}, 

2.  N 2  projects  over  some  open  set  1/CR2,  and 

3.  the  characteristic  vector  field  (h  is  tangent  to  N 2  at  every  point  of  N2 . 
If  N 2  satisfies  these  conditions  and  is  a  Ck  submanifold,  then  the  solution  z 
is  of  class  Ck+1 . 
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2.3.  Remark.  For  our  future  applications  it  is  important  to  realize  that  al¬ 

though  H  is  a  smooth  function,  the  zero  set  {H  =  0}  need  not  be  a  smooth 
submanifold  of  R4.  For  a,b  >  0  we  will  be  interested  in  H(x,y,p,  q)  = 
p2  +  q2  —  a2x 2  —  b2y2.  The  zero  set  { H  =  0}  is  then  a  cone  that  is  singular  at 
(0,0,  0,0).  However,  if  N2  is  a  smooth  surface  in  {H  =  0}  that  is  everywhere 
tangent  to  £#  and  which  projects  onto  an  open  set  in  R2,  then  N2  is  the  jet 
of  a  solution  to  z2  +  z2  =  a2x 2  +  b2y2.  This  is  because  dH  only  vanishes  at  one 
point  and  thus  the  set  of  points  on  N2  where  dH  0  is  dense.  □ 

It  is  useful  to  give  a  name  to  submanifolds  that  satisfy  two  of  the  conditions 
of  the  proposition: 

2.4.  Definition.  Let  H:  R4  — >■  R  be  a  Ck  function  with  k  >  2  and  charac¬ 
teristic  vector  field  £#.  Then  a  connected  two  dimensional  submanifold  N 2  of 
class  C 1  is  an  invariant  Lagrangian  surface  iff 

f.  N2  is  a  Lagrangian  surface  (i.e.  the  restriction  of  u  to  N2  vanishes),  and 

2.  The  characteristic  vector  field  £#  is  tangent  to  N 2  at  all  points  of  N2.  □ 

Proof  of  Proposition  2.2.  First  assume  that  the  three  conditions  hold.  Then 
in  light  of  Proposition  2.1  it  is  enough  to  show  that  the  restriction  of  u  to  N2 
is  zero.  If  we  are  at  a  point  P  G  N2  where  dH  f  0,  then  from  the  definition 
of  fj-j  or  the  formula  (2.3)  it  follows  that  £#  7^  0.  Let  X  be  a  tangent  vector 
to  N 2  at  P  linearly  independent  from  £#  (P).  Then  {£#(P),  X}  is  a  basis  for 
the  tangent  space  T(N2)p.  Using  the  definition  of  £#  and  that  dH(X)  =  0 
(as  H\m  =  0) 

u(£H,X)  =  -dH{X)  =  0. 

Thus  u  restricted  to  N2  vanishes  at  P.  But  we  are  assuming  that  the  set  of 
points  P  where  dH  does  not  vanish  is  dense,  thus  by  continuity  the  restriction 
of  u)  to  N2  is  zero  on  all  of  N2. 

Conversely,  if  N 2  is  the  jet  of  a  solution,  then  clearly  N2  C  {P  :  H(P )  =  0} 
and  N 2  projects  onto  an  open  subset  of  R2  and  also  H |  2  =  0.  Let  P  G  N2. 
Then  cu(X,Y)  =  0  for  all  X,Y  G  T(N2)p.  But  a  calculation  shows  that  if 
Z  is  any  vector  tangent  to  R4  at  P  with  u>(Z,X)  =  0  for  all  X  G  T(N2)p, 
then  Z  G  T(N2)p.  But  for  X  G  T{N2)p  we  have  dH{X)  =  0  so  (as  above) 
uj(£h,X)  =  —dH{X)  =  0.  Thus  £#  G  T{N2)p.  This  completes  the  proof.  □ 

This  result  makes  the  geometry  of  the  method  of  characteristics  clearer 
than  the  classical  presentations.  Let  H  be  of  class  Ck  for  k  >  2.  Then  the 
formula  (2.3)  makes  it  clear  that  the  vector  field  £h  is  of  class  Ck~ 1 .  Let  <f>^ 
be  the  flow  of  That  is,  4>^(P)  =  P  and  c(t)  :=  $^(P)  is  an  integral  curve 
of  the  vector  field  Then  (see  [9,  Thm  I,  p.  80]  or  [I,  p.  230])  the  map 
(t,  P )  1 — y  4>^(P)  is  Ck~l .  Now  let  c:  (a,  b)  — »  R4  be  a  curve  of  class  Cl  ( l  >  I) 
and  let  ir:  R4  — >  R2  be  the  projection  n(x,y,p,q)  =  ( x,y ).  Assume 
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1.  H(c(s))  =  0, 

2.  c  projects  over  an  imbedded1  curve  of  R2  (that  is  s  i-A  7r(c(s))  is  an 
imbedded  curve  in  R2),  and 

3.  at  all  points  c(s)  the  vectors  7r*c'(s)  and  n t*Ch(c(s))  are  linearly  indepen¬ 
dent. 

Then  let  F(s,t)  :  =  $^(c(s)).  By  the  implicit  function  theorem  for  r  small 
enough  the  submanifold  N2  :  =  { F(s,t )  :  (s,t)  G  (a,  b)  x  (— r,  r)}  will  project- 
over  an  open  subset  U  of  R2.  Also,  as  II  is  constant  along  the  flow  of  £#  and 
H(c(s ))  =  0,  we  have  that  H(F(s,t))  =  H(^(c(s)))  =  0.  Thus  H\n2  =  0. 
By  construction  £#  is  tangent  to  N 2  and  so  N2  is  the  jet  of  a  solution  z  to 
H(x,y,zx,  zy)  =  0  by  Proposition  2.2.  The  regularity  of  N 2  is  and 

therefore  the  regularity  of  z  is  cmin^k,l+1\ 

2.2.  Solutions  near  critical  points  of  H.  We  now  look  at  the  more  inter¬ 
esting  case  of  finding  solutions  near  a  critical  point,  Po,  of  //.  Assume  that 
dHp0  =  0  and  by  adding  a  constant  to  II  we  can  assume  that  H(P0 )  =  0. 
Then  near  P0  the  set  {H  =  0}  need  not  be  a  submanifold  of  R4.  Assume 
that  Po  is  a  non-degenerate  critical  point  so  that  the  Hessian  of  II  at  Po  is 
non-singular.  To  make  the  notation  easier  we  assume  that  P0  =  (0,0,  0,0). 
Then  using  the  form  (2.3)  we  see  that  the  linearization  of  the  characteristic 
system  for  this  vector  field  at  the  origin  is 


x(t) 

HXp 

Hyp 

Hpp 

Hpq 

x(t) 

x(t) 

d 

y(t ) 

HXq 

Hyq 

Hpq 

Hqq 

y(t ) 

—  T 

y(t) 

dt 

ph) 

H-xx 

HXy 

HXp 

Hxq 

ph) 

-  ±J 

pit ) 

lit) 

HXy 

Hyy 

~Hyp 

Hyq 

y{t) 

Q(t) 

where  all  the  second  partial  derivative  are  evaluated  at  (0,  0,  0,  0)  and  this  equa¬ 
tion  defines  L.  Let  det(D2P)  be  the  determinant  of  the  Hessian  at  (0,0,  0,0) 
and  let 


c2  :=  2 Hxy  Hpq  -  2 Hxq  II,m  +  HmHqq  -  H2n  +  IIxxIImJ  -  Hi 


The  characteristic  polynomial  of  L  is  then,  using  that  det(L)  = 
(which  can  be  seen  by  noting  that  L  =  J  D2H,  where  J  =  _°^ 
det(L)  =  det(J)  det {D2H)  =  det (P2P)), 

det(AJ  -  L)  =  A4  +  c2A2  +  det(P2P). 


det(P2P), 
42  ,  so  that 


XA  curve  c  is  imbedded  iff  it  is  the  image  of  a  smooth  injective  map  7  :  /  — »  R2,  with  /  an 
interval  in  R,  so  that  the  velocity  vector  7'  never  vanishes  and  so  that  the  topology  of  y[/] 
as  a  subset  of  R2  is  the  same  as  the  topology  induced  by  7  (  i.e.  7  is  a  homeomorphism) . 
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Therefore  the  eigenvalues  are 
(2.5)  ±1 


-c2+  4-4  det{D2H) 


±1 


— c2  -  y/4  -4det{D2H) 


Assuming  that  there  is  no  eigenvalue  with  zero  real  part  we  see  that  there  are 
exactly  two  eigenvaules  with  positive  real  part  and  two  with  negative  real  part. 
Then  let  N+  be  the  local  stable  manifold  for  the  critical  point  at  (0,0,  0,0) 
and  N‘i  the  local  unstable  submanifold.  (Loosely  Nf  is  the  set  of  points 
P  G  R4  so  that  limt^±00  <f>^(P)  =  (0,0,  0,0).  See  Appendix  A.l.)  Because  of 
the  condition  on  the  eigenvalues  of  L  both  N+  and  Nf  are  two  dimensional. 
If  H  is  of  class  Ck  for  k  >  2  then  the  flow  <f>^  is  of  class  Ck~l .  Therefore 
Nf_  and  Nf  are  Ck~l  submanifolds  of  R4  (see  [16,  Thm  5.20,  p.  49]).  Then 
for  a  point  P  G  N'i  we  have,  by  the  invariance  of  H  under  the  flow,  that 
H(P )  =  lim^y-oo  H($f(P))  =  H( 0,0,  0,0)  =  0.  Thus  H\n2  =  0.  Also  it  is 

clear  that  the  characteristic  vector  field  fn  is  tangent  to  N±.  The  condition 
of  Proposition  2.2  that  does  not  hold  automatically  for  N±  is  that  of  being 
locally  projectable.  Summarizing: 


2.5.  Theorem.  Let  H :  R4  — *  R  be  a  Ck  function  with  k  >  2.  Assume  that 
H  has  a  non-degenerate  critical  point  at  (0,0, 0,0)  with  H( 0,0,  0,0)  =  0  and 
assume  that  no  eigenvalue  of  L  as  defined  above  has  zero  real  part.  Then  there 
are  exactly  two  eigenvalues  Ai,A2  of  L  that  have  positive  real  part.  Let  e\ 
and  e2  be  the  eigenvectors  corresponding  to  Ax  and  X2,  let  J\f  be  the  subspace 
spanned  by  e\  and  e2  and  let  ir:  R4  — *  R2  be  the  projection  n(x,y,p,q )  = 
( x,y ).  Assume  that  the  restriction  7r :  J\f  — »  R2  is  nonsingular.  Then  near 
(0,0,  0,0)  the  unstable  submanifold  Nf  of  £h  is  the  jet  of  a  Ck  solution  z  to 
H(x,y,zx,zy)  =  0.  An  analogous  statement  holds  for  the  stable  submanifold 
N2+  of  iH. 

Proof.  The  tangent  space  to  Nf  at  (0,  0,  0,  0)  is  Af  so  if  7r  | ^ :  J\f  — *  R2  is 
nonsingular  the  implicit  function  theorem  implies  that  near  (0,  0,  0, 0)  the  sub¬ 
manifold  Nf  will  project  onto  an  open  subset  of  R2.  As  noted  all  the  other 
hypothesis  of  Proposition  2.2  hold.  This  completes  the  proof.  □ 


As  an  example  of  this  assume  that  H(x,y,p,q)  =  f(p,q )  —  h(x,y )  where  / 
and  h  have  critical  points  at  (0,0)  and  the  Hessians  of  /  and  h  at  (0,0)  are 
both  positive  definite.  (This  is  a  case  that  comes  up  in  the  shape  from  shading 
problem.)  Then  the  linear  map  L  is 

0  A 

B  0  ’ 


L  = 


0 

0 


0 

0 


h 


fpp  fpq 
fpq  fqq 


xy 
xy  hyy 


0 

0 


0 

0 
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where  A  and  D  are  positive  definite  2x2  matrices.  As  A  is  positive  definite 
it  has  a  square  root  A*.  The  matrix  A^BA^  will  also  be  positive  definite  and 
will  consequently  have  positive  eigenvalues.  Let  a2  and  b 2  be  the  eigenvalues 
of  A^BA?,  where  a,  b  >  0,  and  e4,e2  the  corresponding  eigenvalues, 

A^BA^ei  =  a2ei,  A^  BA^  e2  =  b2e2, 

and  define 

Vl  :=  a-± 

a  A  2  ei 


A2e2 

L4-2e2 


v3  :  = 


A?e  1 
—aA 

By  direct  calculation,  we  find 


v4  :  = 


A2e2 
-bA 


1 

2e2 


Lvi  =  avi,  Lv2  =  &v2,  I/V3  =  —  av3,  Lv4  =  — fev4. 

Thus  the  eigenvalues  of  L  are  ±a  and  ±6.  The  eigenvectors  corresponding  to 
the  two  eigenvalues  with  positive  real  part  are  v4  and  v2.  For  future  reference 
we  record  some  elmentary  facts  about  these  eigenvalues  and  eigenvectors.  The 
proof  is  left  to  the  reader. 


2.6.  Lemma.  If  a  7^  b  then  there  are  exactly  six  two  dimensional  subspaces 

of  the  tangent  space  T{  R4)  that  are  invariant  under  L,  corresponding  to  the 
six  ways  of  choosing  two  of  the  eigenvectors  v4, . . . ,  v4.  As  cn(vi,  v3)  7^  0  and 
tn(v2,v4)  7^  0  the  subspaces  span(v4,  v3)  and  span(v2,  v4)  can  never  be  tangent 
to  jets  of  functions  and  therefore  can  be  disregarded  from  our  considerations. 
Each  of  the  remaining  four  pairs  {v!,v2},  {v3,v4},  {v!,v4},  and  {v2,v3} 
spans  a  two  dimensional  subspace  of  the  tangent  space  T(R4)0  on  which  u 
vanishes.  If  {v.;,  v^}  is  any  one  of  these  four  pairs  and  ir:  R4  — *  R2  is  the 
projection  ir(x,y,p,q )  =  (x,y)  then  7r*v.;  and  7r*Vj  are  linearly  independent. 
Therefore,  if  N2  is  any  two  dimensional  submanifold  tangent  to  span(vi,Vj) 
then,  by  the  implicit  function  theorem,  N 2  locally  projects  over  some  open 
neighborhood  U  of  (0,0)  in  R2.  Finally,  span(vi,v2)  is  the  tangent  space  at 
the  origin  to  the  unstable  submanifold  of  fn  and  span(v3,v4)  is  the  tangent 
space  at  the  origin  to  the  stable  submanifold  of  £#.  □ 

The  following  gives  a  generalization  of  a  theorem  of  Brass  [4],  who  considered 
the  special  case  of  u2  +  u2y  =  h(x,y).  The  proof  uses  the  same  geometric  idea 
as  Brass’s  proof. 

2.7.  Theorem  (A.  Brass  [4]).  Let  f(p,q )  and  h(x,y )  be  functions  of  class 
Ck ,  with  k  >  2,  and  assume  that  f  and  h  both  have  critical  points  with  positive 
definite  Hessians  at  (0,  0).  Then  in  the  class  of  functions  that  are  concave  near 
(0,0)  there  is  a  unique  (up  to  an  additive  constant)  solution  to  f(zx,zy )  = 
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h(x,  y ).  This  solution  is  Ck  and  has  as  its  jet  near  (0, 0)  the  stable  manifold  of 
the  characteristic  vector  field  £#  of  H(x,y,p,q)  =  f(p,q)  —  h(x,y).  (Likewise, 
there  is  a  unique  convex  solution,  it  is  Ck  and  has  as  jet  near  (0, 0)  the  unstable 
submanifold  of  (h-) 

Proof.  Follows  from  the  discussion  above.  □ 


3.  Construction  of  saddle  type  solutions  near  a  regular 

CRITICAL  POINT 

3.1.  Construction  of  all  solutions  to  zi  +  z2  =  a2x2  +  b2y2.  We  will  now 

x  y 

construct  all  “saddle  point”  solutions  of  the  equation 

zx  +  Zy  —  a  x  +  b  y  , 

where  a,b  >  0.  The  analysis  here  is  a  model  for  the  more  general  setup 
covered  in  Section  3.2.  This  special  case  is  also  of  interest  as  it  is  possible 
to  be  somewhat  more  precise  about  the  regularity  of  the  solutions.  What 
makes  this  equation  especially  easy  to  analyze  is  that  the  components  of  the 
characteristic  vector  field  are  linear  so  that  finding  the  flow  involves  no  more 
than  linear  algebra.  We  will  assume  that  we  have  a  solution  defined  near 
(0,0)  with  2(0,0)  =  0.  If  a  ^  b,  then  by  a  formal  power  series  argument 
as  in  the  proof  of  Proposition  4.1  below,  we  have  that  if  2  6  C3  then  near 
(0,  0)  the  first  few  terms  of  its  Taylor  series  are  2  =  ^(±ax2  ±  by2)  +  0((|x|  + 
|y|)3)  and  if  a  =  b  we  can  bring  2  to  this  form  by  a  rotation  of  the  axes. 
If  2  =  \{ax2  +  by2)  +  0((|rc|  +  |r/|)3),  then  Theorem  2.7  applies,  since  the 
function  is  convex.  Hence  2  =  |( ax 2  +  by2)  is  the  unique  solution.  Likewise, 
if  2  =  —  \{ax2  +  by2)  +  0((|x|  +  |y|)3),  then  2  =  —  \{ax2  +  by2). 

Next  we  look  for  a  solution  of  the  form 

*  =  \(ax2  -  by2)  +  0((|z|  +  \y\)3). 

We  will  use  the  Hamiltonian  H  =  |(p2  +  q2  —  a2x 2  —  b2y2).  The  characteristic 
vector  field  is  then  +  q ^  +  a2x ^  +  b2y The  integral  curves  of  this 

vector  field  satisfy  the  differential  equation,  cf.  (2.4), 
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The  eigenvectors  of  the  matrix  L  are 


which  satisfy 


Lw  i  =  avi,  Lw  2  =  —  &v2,  I/V3  =  —aw  3,  Lw  4  =  bv  4. 

The  jet  of  the  function  z  =  \ (ax2  —  by2)  is  the  span  of  Vi  and  v2. 

To  construct  an  invariant  Lagrangian  surface  through  (0,0,  0,0)  we  start 
with  two  curves  j±  :  R  — >  R4  and  generate  the  surface  by  moving  these  curves 
by  the  flow  of  the  characteristic  vector  field.  Because  of  the  nature  of  the 
singularity,  two  curves  rather  than  just  one  are  required.  Toward  this  end  let 
R  — *  R  be  two  functions  so  that 

1.  ip±  are  Ck  functions  for  some  k  >  1  and 

2.  (p±  vanish  to  order  l  at  the  origin  for  some  l  <  k  (where  l  need  not  be  an 
integer).  Specifically,  this  means  that  there  are  functions  Tp± :  R  — *  R  so 
that 

(3.2)  <p±(s)  =  p±(s)sl 


where  <^±(s)  are  Ck  functions  on  R  \  {0}  and  the  derivatives  jj(p±(s) 
are  bounded  on  [—1, 1]  \  {0}  for  0  <  3  <  I 

Define  two  curves  7+ ,  :  R  — *  R4  by 

a2 

7±(s)  =  sw±  ±  sv2  +  tp± (s) v3  t  —  (p±(s)w4. 

Then 

Hh±{s))  =  0 

for  all  s.  Let  etL  be  the  exponential  of  L  so  that  for  P  E  R4  the  integral  curve 
of  the  characteristic  vector  field  through  P  is  1 etLP.  Define  F±  :  R2  — 1  R4 
by 

(3.3)  F±(s,t)  :=  eti7±(s)  =  seaVi  ±  se~btw2  +  <P ±{s)e~atw3  T  j^V±(s)ebtv4. 

We  now  consider  F+(s,t )  for  s  >  0.  Do  the  change  of  variables  ( s,t )  i-»  (u,v) 
with  u,  v  >  0  given  by 


f  u  =  seat 
y  v  =  se~bt 
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In  these  coordinates  we  have  that  the  image  of  F+(s,t)  with  s  >  0  can  be 
parameterized  by 


G(u,  v )  =,uv1  +  vv2 

2 

~a  a  ,  b  a  Cl  b  -b  b  a 

+  Ua+bVa+b  p+(Ua+bVa+b)\^  —  —'ll  a+b  Va+b  ip+  ['ll  a+bV  a+b  J  V4 . 

Ir 

3.1.  Lemma.  Let  p  be  a  function  on  [0,  00)  of  class  Ck  that  vanishes  to  order 
l  <  k  at  t  =  0  and  let  1  >  a,  /3  >  0.  Then  the  function 


E(u,  v ) 


ua  1vl3p(uav13),  u,v  >  0 
0,  otherwise 


is  Cn  on  R2  for  all  n  <  min{(7  +  l)a  —  1,(1  +  1 )/?}.  Conversely,  if  E(u,v)  is 
of  class  Cn ,  then  p  vanishes  of  order  l  at  t  =  0  for  all  l  >  max{(n  +  1  )/ct  — 
l,n/(3  -  1}. 

Proof.  The  hypothesis  on  p  implies  that  p(t)  =  tlif(t )  where  if  has  continu¬ 
ous  derivatives  up  to  order  k  on  the  open  interval  (0,  00)  and  is  bounded 
on  the  closed  interval  [0,oo).  Thus  E  can  be  rewritten  as  E(u,v )  = 
u{i+i)a-iv{i+i)y^uoLvy^  for  u,v  >  q_  Tlte  result  now  follows  by  direct  cal¬ 
culation  of  the  derivatives.  □ 


This  directly  implies: 

3.2.  Theorem.  Let  ip± :  [0,  00)  — *  R  be  Ck  functions  that  vanish  to  order 
l  <  k  at  0.  Let  F±  be  defined  by  equation  (3.3)  and  let  N2(p+,p_)  be  the 
closure  of  the  union  of  the  images  of  F+  and  F_,  that  is, 

N2(p_ |_,  p_)  :=  image(F+)  U  image(F_). 

Set  n  :=  |~min{  —  1,  where  [•]  is  the  ceiling  function.  Then 

N2(p. |_,  p_)  is  a  Cn~l  submanifold  of  R4  that  is  the  jet  of  a  Cn  solution  z(x,  y ) 
to  the  equation  z2  +  z2  =  a2x 2  +  b2y2.  Conversely,  any  Cn  solution  of  this 
equation  is  of  this  form  for  unique  ip+  and  p_.  Regardless  of  the  value  of  l, 
the  function  z  will  be  of  class  Ck+1  on  R2  \  {xy  =  0}.  Thus  there  is  a  decrease 
in  regularity  of  solutions  from  Ck+1  to  Cn  along  the  coordinate  axes. 

Proof.  All  but  the  statements  about  the  decrease  in  regularity  along  the  co¬ 
ordinate  axes  follow  from  Lemma  3.1.  The  two  curves  where  the  regularity  of 
the  surface  N2(p+,  pJ)  drops  are  along  the  lines  u  i-a  G(u,  0)  (which  projects 
down  onto  the  x-axis)  and  v  H>  G(0,  v )  (which  projects  down  onto  the  y- axis). 
All  other  points  of  N2(p+,  p_)  are  Ck.  As  in  Proposition  2.2,  this  implies  that 
z  is  Ck+1  off  of  the  coordinate  axes.  □ 
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3.2.  Solutions  to  H(x,y,zx,zy)  =  0  near  a  critical  point  of  H.  We  now 

consider  an  equation  H(x,  y,  zx,zy)  =  0  near  a  critical  point  of  H .  We  assume 
that  H  G  C°°,  H( 0,0, 0,0)  =  0  and  dH  =  0  at  the  origin.  Let  L  be  the 
linearization  at  the  origin  of  the  characteristic  system  defined  by  (2.4)  and 
assume  that  the  eigenvalues  of  L  are  a,  b,  —a,  —b  where  a,  b  >  0.  (By  (2.5)  the 
eigenvalues  are  of  this  form  if  they  are  real  and  nonzero.)  Let  ei,e2,e3,e4  be 
the  eigenvectors  for  a,  b ,  —a,  —b  respectively.  We  make  the  two  assumptions 

(3.4)  a  and  b  are  linearly  independent  over  the  rational  numbers, 

and 

/o  r\  {  {71*64  ,  7T*  e2 { ,  {71*63,  71*64)',  {71*64 ,  7T*64),  {7T*62,  7T*63{ 

'  {  are  each  linearly  independent  sets. 

(Where,  as  usual,  n:  R4  — *  R2  is  ir(x,y,p,q)  =  ( x,y ).)  This  and  the  implicit 
function  theorem  imply  that,  if  N2  is  any  C1  surface  in  R4  that  is  tangent  to 
both  ei  and  e2  (or  one  of  the  other  pairs  {e3,e4},  {ei,e4}  or  {e2,e3}),  then 
locally  N 2  projects  over  an  open  neighborhood  of  the  origin  in  R2. 

The  number  theoretic  assumption  (3.4)  is  what  is  needed  to  invoke  the 
Sternberg  Normal  Form  (Theorem  A. 2  in  the  Appendix)  and  conclude  there 
are  local  coordinates  x,  y,  p.  q  centered  at  the  origin  of  R4  with  u  =  dp  A 
dx  +  dq  A  dy  and  a  function  f(u,v )  with  /( 0,0)  =  0,  /„(0,  0)  =  fv( 0,0)  =  1 
and  H  =  |/(p2  —  a2x2,q2  —  b2y2).  In  the  coordinates  x,y,p,q  the  eigen- 
directions  of  L  are  given  by  the  vectors  vi,v2,v3,v4  of  (3.9)  below.  So  in 
these  coordinates  we  can  take  e±  te  v1;  e±  W.  v4,  e3  =  v3  and  e4  =  v2.  We  then 
can  compute  that  u;(ei,e2)  =  o;(e3,e4)  =  o;(ei,e4)  =  o;(e2,e3)  =  0,  but  that 
aj(e4,  e3),  cj(e2,  e4)  ^  0.  This  leads  to  a  general  existence  result. 

3.3.  Theorem.  Let  the  origin  of  R4  be  a  non- degenerate  critical  point  of  H 
such  that  the  linearization  L  of  the  characteristic  vector  field  £#  at  the  origin 
satisfies  the  conditions  (3.f)  and  (3.5).  Then,  with  the  notation  above,  each  of 
the  pairs  {ei,e2},  {e3,e4},  {ei,e4}  and  {e2,e3}  is  tangent  to  the  jet  of  a  C°° 
solution  to  H(x,y,  zx,  zy)  =  0. 

Proof.  Consider  the  four  submanifolds  N2._  of  R4  defined  locally  near  the 
origin  in  the  coordinates  x,y,p,q  by 

Nl,±  '■=  {( x,y,±ax,±by )  :  (x,y)  G  U}, 

where  U  is  a  small  neighborhood  of  (0,  0)  in  R2.  On  N±t±  the  relations p  =  Lax 
and  q  =  Lby  hold.  Thus  on  ±  we  have  H  =  \f(fp2  —  a2x2,q2  —  b2y2)  = 
|/(0,0)  =  0  and  therefore  Ar|  _t  C  { H  =  0}.  A  direct  calculation  using  the 
form  of  the  characteristic  vector  field  (h  in  the  coordinates  x,y,p,q  (see  (3.7) 
below)  shows  that  (h  is  tangent  to  N±,±.  Thus  Ar2  ±  is  an  invariant  Lagrangian 
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surface.  We  also  have 

T(iVj)+) o  =  span(ei,  e2),  T(iV2  _)0  =  span(e3,  e4), 

T(iV+ _)0  =  span(ei,  e4),  T(iV2)+)0  =  span(e2,  e3). 

Therefore  the  assumption  (3.5)  implies  that  locally  near  (0,0,  0,0)  each  of 
the  surfaces  N±±  projects  onto  an  open  neighborhood  of  (0,0)  in  R2.  Con¬ 
sequently  Proposition  2.1  implies  that  iV2)±  is  the  jet  of  a  C°°  solution  to 
H(x,y,  zx,zy)  =  0.  □ 

If  N 2  C  R4  is  the  jet  of  a  solution  to  H(x,  y,  zx,  zy)  =  0  that  passes  through 
(0,0,  0,0),  then  N 2  is  an  invariant  Lagrangian  surface  in  R4.  The  invari¬ 
ance  of  N2  under  the  flow  of  the  characteristic  vector  field  (h  implies  that 
T(N2) o  is  invariant  under  the  linearization  L  of  This,  coupled  with  the 
fact  that  u  vanishes  on  T(iV2)0,  implies  that  T(N2)0  is  one  of  the  four  sub¬ 
spaces  span(ei,e2),  span(e3,e4),  span(ei,e4),  or  span(e2,e3)  of  T(R4)0. 

The  subspace  span(ei,e2)  is  tangent  to  iV2)+  at  the  origin.  This  is  the 

unstable  submanifold  of  (h  at  (0,  0,  0,  0)  and  is  unique  in  the  sense  that  if  N2  is 
a  C1  invariant  submanifold  for  (h  with  T(N2)0  =  span(ei,  e2)  then  N 2  =  iV2)+ 
in  a  neighborhood  of  the  origin.  To  see  this  note  that  the  restriction  £#  |  ~2  has  a 
source  at  the  origin  (as  the  eigenvalues  of  the  restriction  L\~2  are  a,  b  >  0)  and 
so  for  all  points  P  of  N2  sufficiently  near  the  origin  lim^^  $t(P)  =  (0,  0,  0,  0). 
But  this  is  exactly  the  condition  that  P  be  on  the  unstable  submanifold.  Thus 
N 2  =  iV2  +  as  claimed.  There  is  a  corresponding  uniqueness  statement  for  the 
stable  submanifold  N2  _ . 

i 

Call  a  solution  z  to  H(x,y,  zx,  zy)  =  0  with  ^ (0,0)  =  2^(0, 0)  =  0  a  solu¬ 
tion  with  unstable  jet  iff  its  jet  near  the  origin  is  the  unstable  submanifold 
N2  +  for  Likewise,  z  is  a  solution  with  stable  jet  iff  its  jet  near  the 
origin  is  the  stable  submanifold  N2_  for  £ H .  Our  discussion  leads  at  once  to 
a  uniqueness  result. 

3.4.  Theorem.  If  the  hypotheses  of  Theorem  3.3  hold  and  if  z  is  a  solution 
to  H(x,y,  zx,  zy)  =  0  with  z(0,0)  =  ^(0,0)  =  zy(0,0)  =  0  of  either  stable  or 
unstable  type,  then  z  is  unique  in  the  sense  that  if  z  is  any  C 2  solution  with 
5(0,0)  =  5a, (0,0)  =  5j,(0,  0)  =  0,  such  that  the  jet  of  z  is  tangent  to  the  jet 
of  z  at  (0,0,  0,0),  then  z  =  z  in  a  neighborhood  of  the  origin.  (The  tangency 
condition  of  the  jets  is  equivalent  to  zxx( 0,0)  =  ^(0,0),  zxy( 0,0)  =  ^(0,0) 
and  zyy(0,0)  =  zyy(0,0).J  □ 

Call  a  solution  to  H(x,y,  zx,  zy)  =  0  with  z(0,0)  =  2^(0, 0)  =  ^(0,0)  = 
0  whose  jet  at  the  origin  is  not  the  stable  or  unstable  submanifold  of  the 
characteristic  vector  field  (h  a  saddle  solution.  The  motivation  of  this 
terminology  is  that  for  the  eikonal  equation  z2  +  z2  =  a2x 2  +  b2y2  with  a,  b  >  0 
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the  solution  with  unstable  jet  is  the  convex  solution  z  =  |( ax 2  +  by2)  and 
the  solution  with  stable  jet  is  the  concave  solution  z  =  —  ^(ax2  +  by2).  The 
other  two  obvious  solutions  z  =  ±|(a:r2  —  by2)  (which  are  not  unique  by 
Theorem  3.2)  have  saddle  points  at  the  origin. 

We  now  wish  to  investigate  in  detail  the  lack  of  uniqueness  of  saddle  solu¬ 
tions.  To  do  this  it  is  more  convenient  to  work  in  the  coordinates  x,y,p,q.  To 
simplify  notation  we  drop  the  bars  and  just  write  x,y,p,q  but  keep  in  mind 
that  these  are  not  the  original  symplectic  coordinates  on  R4  and  that  once  we 
have  constructed  the  jets  of  solutions  we  have  to  translate  these  results  back 
to  statements  in  the  original  symplectic  coordinates.  With  this  in  mind,  we 
assume  that  H  has  form 

(3.6)  H  =\f{p2  ~a2x2,q2  -b2y2), 

where 

/ (0,  0)  =  0,  /u(0,0)  =  /„(0,0)  =  il, 

and 

u  =  dp  A  dx  +  dq  A  dy. 


Then  the  characteristic  vector  field  is 


(3.7) 


d  d  d  0 

6r  =  fuP-z-  +  fvq-z-  +  a2fux—  +  b2fvy—. 
ox  Oy  op  oq 


The  integral  curves  of  this  vector  field  satisfy  the  differential  equation 


X 

O 

O 

O 

X 

d 

y 

o 

o 

o 

y 

dt 

p 

fuCL2  0  0  0 

p 

q 

o  fvb 2  0  0 

q 

This  is  not  a  linear  system  as  the  functions  fu  and  fv  depend  on  x,  y,  p  and 
q.  However  we  will  show  that  because  of  its  special  structure  it  can  be  treated 
almost  as  if  it  were  linear.  Letting  L  be  the  matrix  of  this  system  we  find  that 
its  eigenvectors  are,  cf.  (3.1), 


(3.9) 


l 

0 

0 

1 

Vl  = 

a 

,  v2  = 

0 

0 

-b 

1 

0 

0 

1 

v3  = 

—a 

,  v4  = 

0 

0 

b 

and  that 


Tv,  =  /„av| ,  Lv2 


~fvbv2,  Lv3 


~fuav3,  Lv4  =  fvbv4. 


Thus  L  has  a  basis  of  eigenvectors  that  are  independent  of  the  variables  x,  y. 
p  and  q.  For  use  in  the  statement  and  proof  of  Theorem  3.9  we  define  the  two 
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curves 

(3.10)  a  x(t)  =  tv1,  a  2(t)  =  tv2, 

which  are  just  the  first  two  coordinate  axes  for  the  basis  vi,v2,  v3,  v4.  What 
is  special  about  these  curves  is  that  while  we  will  construct  many  surfaces  that 
are  jets  of  solutions  and  tangent  to  span(vi,  v2),  these  curves  will  lie  on  all  of 
these  surfaces. 

Let  w4,w2,w3,w4  be  coordinates  on  R4  such  that 

=  W1V1  +  W2V2  +  W3V3  +  W4V4. 


\  (x  +  P/a) ,  w2  =  ^  (y  -  q/b ) , 

\(.x~  P/a )  >  w4  =  ^(y  +  q/b ) . 

In  these  coordinates  the  differential  equations  for  the  characteristics  become 
w'i  (t)  =  fuawi(t),  w'2(t)  =  -fvbw2{t), 

w'3(t)  =  ~fuaw3(t),  w'4(t )  =  fvbw'A(t). 

Recall  that  /„(0,0)  =  fv( 0,0)  =  1.  Therefore  given  e  >  0  there  is  a  5  >  0  so 
that  if  B(8)  :=  {{w\,W2,Wz,w4)  :  'Y/t=iw‘i  —  ^2)  cl°sed  ball  of  radius  8 

at  the  origin,  then  \fu  —  l\,\fv  —  l\  <  e  <  1/2  in  B(8).  We  now  use  that  if  a  func¬ 
tion  w(t)  satisfies  a  differential  equation  w'(t)  =  c  h(t,  w(t))w(t)  on  an  interval 
[ — k,  k],  where  c  is  a  constant  and  \h(t,  w(t ))  —  1|  <  e,  then  on  [— k,  k],  w(t )  = 
w( 0)ec6'lbt  for  a  function  6{t)  that  satisfies  |1— 0\  <  e.  The  differential  equations 
for  the  characteristics  are  all  of  this  form,  so  if  (iOi(0),  iy2(0),  103(0),  104(0))  G 
B(8)  and  t  is  so  that  (iui(t),  iy2(r),  ws(t),  w4(t))  G  B(8)  for  r  between  0  and 
t,  then 

w\  (t)  =  wi(0)ea6lt,  w2{t)  =  w2{  0)e~bd2t, 

w3(t )  =  u>3(0)e_a6l3t,  w4(t)  =  w4(  0)ebe4t, 

where 

|1  —  0i\  <  e  <  -  for  i  =  1 ,  2,  3, 4. 

At 

Let  <p+,  :  R  — »  R  be  two  functions  so  that 

1.  tp±  are  Ck  functions  for  some  k  >  1  and 

2.  <p±  vanishe  to  order  l  at  the  origin  for  some  l  <  k. 


x 

y 

P 

_q_ 

Then 

{w\  = 

W3  = 
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3.5.  Lemma.  There  are  unique  functions  ip±  defined  in  a  neighborhood  of  0 
so  that  if  7±  are  the  curves 

7 ±(s)  :=  svx  ±  sv2  +  <p± (s) v3  +  ^±(s)v4, 

then 

H(  7±W)  =  0. 

Moreover, 

1.  V’zt  are  functions  and 

2.  vanish  to  the  same  order  l  <  k  at  the  origin  that  ip±  do.  As  above, 
this  means  there  are  ip± :  R  — *  R  so  that 

(3.12)  tMs)  =  'ip±(s)sl 

where  fi>±(s)  are  Ck  functions  on  R  \  {0}  and  the  derivatives 
are  bounded  on  [—1, 1]  \  {0}  for  0  <  j  <  l. 

(In  the  case  H  =  \{p2  +  q2  —  a2x2  —  b2y2)  we  have  ip±(s)  =  7 fa2 /b2)p±(s) .) 

Proof.  We  will  show  how  to  find  fi>+,  the  derivation  for  ip_  being  simi¬ 
lar.  In  terms  of  the  coordinates  wi,w2,w3,w4  we  are  looking  for  ^>+(s) 
so  that  if  Wi  =  s,  w2  =  s,  w3  =  p+(s),  and  w4  =  fi>+{s),  then  7+(s)  = 
W\Vi  +  w2v2  +  W3V3  +  W4V4  is  the  required  curve.  Writing  H  in  terms 
of  the  coordinates  x,y,p,q  (related  to  the  coordinates  wi,w2,w3,w4  by 
(3.11))  we  have  H  =  \f(p2  —  a2x2,q2  —  b2y2).  Then  we  want  ip+  so  that 
if(7+(s))  =  |/(— 4a2s<p+(s),  — 4b2sfi+(s))  =  0.  Recalling  the  assumptions 
on  f  (i.e.  /( 0,0)  =  0  and  fu( 0,0)  =  fv( 0,0)  =  1)  we  have  by  the  implicit 
function  theorem  that  there  is  a  smooth  function  v ( • )  defined  in  a  neigh¬ 
borhood  U  of  0  with  u(O)  =  0,  h'(0)  =  —1  so  that  f(u,v(u))  =  0.  Then 
near  (0,0)  we  have  {(u,v)  :  f(u,v )  =  0}  =  {(u,v(u))  :  u  E  U}.  There¬ 
fore  near  s  =  0  we  are  trying  to  solve  —4 b2sip+(s)  =  u(— 4 a2sp+(s))  for 
ip+(s).  As  v(u)  vanishes  at  the  origin  it  can  be  expressed  as  v(u )  =  uv(u), 
where  v  is  a  smooth  function  and  v'(0)  =  —1  implies  u(0)  =  —1.  Therefore 
ip+(s)  =  (a2 /b2)p+(s)v(  — 4a2sp+(s)).  Note  that  if  <p+(s)  vanishes  to  order  l 
at  s  =  0,  then  this  formula  makes  it  clear  that  the  same  is  true  for  fi>+(s). 
Finally,  if  H  =  \{p2  +  q2  —  a2x2  —  b2y2)  then  f(u,  v )  =  u  +  v  and  so  v(u)  =  —u, 
v(u)  =  —  1  which  implies  ip+(s)  =  —  (a2/b2)p+(s).  □ 

Assume  that  p±  is  defined  on  the  interval  {—s0,s0)  and  let  47  be  the  flow 
of  (h-  Then  define  F±:  (—so,sq)  x  R  — *  R4  by 

(3.13)  F±(s,t)  :=  $t(7±(s)). 

Writing  this  as 

F±(s,t)  =  wi(s,t)v!  +  w2(s,t)v2  +  w3(s,t)v  3  +  ic4(M)v4 
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from  the  discussion  of  differential  equations  for  the  characteristics  we  have 


{Wi (s,t)  =  seaeit,  w2 (s,t)  =  ±se  bd2t, 

w3 (s,t)  =  tp±(s)e  t v4(s,t)  =  ip±(s)ebe4‘, 

where  \9t  —  1|  <  e  <  1/2.  We  are  now  going  use  u  =  Wi(s,t)  and  v  =  w2(s,t) 
as  new  variables  and  express  w3(s,t )  and  w4(s,t)  in  terms  of  these  variables. 
As  w2(s,t)  =  ±se~W2t  this  calculation  breaks  up  corresponding  to  the  choice 
of  signs.  To  simplify  notation  we  will  do  the  calculation  in  the  first  quadrant 
of  the  un-plane,  which  corresponds  to  choosing  +  in  the  definition  of  w2,  using 
the  functions  <p+,  ip+,  and  restricting  to  s  >  0.  The  calculations  in  the  other 
quadrants  are  identical.  In  the  first  quadrant 


u  =  Wi(s,t)  =  sea9lt,  v  =  w2(s,t )  =  se  bd2t, 


with  s,  u,  and  v  positive.  Solving  for  s  and  t  we  get 


s  =  ub/(.a+b)va/(.a+b)eab(02-Oi)t ^  ^  _ 


In  ( - 


a6\  +  b9  2  V  v 


Thus 


where 


.  fu\  l/(“01+&02)  /ti\  (1/(“+fo))+/31 

* = y  =  y 


pi  = 


1  1  (1  —  9i)a  +  (1  —  92)b 

cl9\  T  b92  a  T  b  (u  -f  P)(oPi  T  b92 ) 
As  we  are  assuming  that  |1  —  P;|  <  £  <  1/2  this  leads  to 


|Pi|< 


2e 

a  +  b 


=:  Cxe. 


This  implies 


s  =  ,ua+bva+b(et)ab^2  dl'> 


=  u 


a+bya  +  b  ^(uV  1)“+6+Pl^ 


02-fl) 


where 


b  a  /  _  1  \  02 
=  ua+bva+b  (uv  J  , 

1 


P2  ■  = 


a  +  b 


+  Pi  ab(92  —  9 1). 


As  |  Pi  —  9 2 1  <  2e,  | pi  |  <  2e/(a  +  b)  <  l/(a  +  6), 

M  <  ^  ==  c*. 

a  +  b 


FIRST  ORDER  NON-LINEAR  PDE  IN  TWO  VARIABLES 


19 


w3(s,t)  =  i P±{s)e  a°3t 

=  <p±(u“+tv'^E  (tiu_1)P2)  (up-1) ^ a+b+Pl)^  ad3'1 
=  ip±  (u^v^  (■ uv~1)p 2)  (up-1)  ^  a+b+P3^ 


where 


(1-03  )a  n 

p3  ■—  - ;  7 - Pid03. 

a  +  b 


Using  the  estimates  |pi|  <  2 e/{a  +  b)  and  |1  —  03\  <  e  <  1/2  (so  that  1/2  < 
03  <  3/2)  this  gives 

ea  2e  3  4ae 

|p3|  <  —T  +  —Toa  =  —T  =■  C3e. 
a  +  b  (2  +  62  (2  +  6 

Using  the  form  of  <p+  given  by  equation  (3.2)  we  have 

—7n  (n^Tb  P2 1  C  -?  /  aTE  -?  i  aTb  Pi/ii-1!^2^  (  a+6+P3) 


w3  =  <p+  ('U“+i,U“+f>  (tin  1)P2)  (u “+6 v a+b  ( uv  1)P2J  {uv  2) 
(3.15)  =!p+(u“+iV“+i  (tiu_1)P2)ti  a+b  v  °+b  {uv~1)lp2+p3 . 


Likewise 


where 


Thus 


w4{s,t)  =  i/j +{s)eWit 

=  ip+  (ii“+5n3+E  (tiu_1)P2)  (m;_1++f>+p1)604 
=  '0+('U“+sU“T5  (tin_1)P2)  (tiu-1)(“+t+P4)  , 


(04-1)0 

pi  —  - — - h  p\b6i. 

a  +  b 


eb  2e  3  Abe 

P4  <  ——  +  — TN0  =  — T  =:  C4£. 
a  +  o  a  +  o  2  a  +  b 


Using  the  form  of  ip+  given  by  equation  (3.12)  we  have 

wa  —  ip+  ('U“+5U“T&  (tin_1)P2)  ^ti^+H^  (uy_1)P2^  (tin-1) (“+6+p4) 


(3.16)  =  ^+{ua+bva+b  {uv  1 ) p2 ) u  *+*  v  a+b  {uv  iyp2+p 4. 

3.6.  Definition.  Let  He  a  positive  integer,  m  a  positive  real  number,  and 
U  a  connected  neighborhood  of  the  origin  in  R2.  Then  we  denote  the  class  of 
functions  defined  on  U  that  are  Ck  off  of  the  coordinate  axes  and  which  vanish 
to  order  m  along  the  axes  by  (++  =  Caxe+U).  Explicitly  h  E  Cixes\u) 


- 1  \  P2  \  tiitll  ,  i , 
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iff  h  is  Ck  on  U  \  {uv  =  0}  and  for  any  compact  subset  V  of  U  there  is  a 
constant  C  =  C'y  so  that 

\h(u,v)\  <  C\u\m  and  \h(u,v)\  <  C\v\m 

hold  for  all  (u,  v)  G  V.  □ 

Using  this  definition  we  now  summarize  and  extend  the  calculations  above. 

3.7.  Proposition.  Let  H :  R4  — *  R  of  class  C°°  be  given  by  (3.6),  let 
(p±:  R  -A  R  be  Ck  ( with  1  <  k  <  oo)  functions  that  vanish  to  order  l  at  the 
origin  (as  in  (3.2)),  and  let  ip±:  R  — *  R  and  the  curves  j± :  R  — *  R4  be  given 
by  Lemma  3.5.  Let  w4 (s,t), . . .  ,w4 (s,t)  be  the  solutions  of  the  characteristic 
equations  given  by  (3.1 4)  and  define  u(s,t )  =  Wi(s,t),  v(s,t )  =  w2(s,t).  As 
above,  we  can  then  solve  for  s  andt  in  terms  of  u  andv.  Then  define  functions 
on  a  neighborhood  of  the  origin  in  R2  by 

g(u,v)  :=  w3(s,t),  h(u,v)  =  w4(s,t),  for  u,v  ^  0, 

and  extend  this  to  the  coordinate  axes  by 

g(u ,  0)  =  g( 0,  v)  =  h(u,  0)  =  h( 0,  v )  =  0. 

Let  m  be  any  real  number  with 

f  bl  —  a  al  —  b  \ 

m  <  mm  < - — , - —  >  . 

\  a  +  b  a  +  b  J 

Then  there  is  a  connected  neighborhood  U  of  the  origin  such  that  g,  h  G 
Cixes\u).  If  <p±{s)  =  0  in  a  small  neighborhood  of  s  =  0,  then  g  and  h 
vanish  in  a  neighborhood  of  the  coordinate  axes,  and  are  Ck.  The  surface 

N 2  =  N2(p+,  Lp_)  :=  {ttvx  +  vv2  +  g(u,v)v3  +  h(u,v)v4  :  (u,v)  G  U} 

is  an  invariant  Lagrangian  submanifold  o/R4  for  some  sufficiently  small  neigh¬ 
borhood  U  of  the  origin  in  R2 . 

Proof.  Most  of  this  follows  at  once  from  the  calculation  above.  The  flow  'ly 
of  the  characteristic  vector  field  is  C°°  and  the  curves  j±  are  Ck.  Therefore 
Wi(s,t )  is  a  Ck  function  of  (s,t)  for  s  /  0.  So  by  the  inverse  function  theorem 
s  and  t  are  Ck  functions  of  (u,v),  and  g  and  h  are  Ck  functions  off  of  the 
coordinates  axes  u  =  0  and  v  =  0.  Along  the  u  =  0  axis  the  formula  (3.15)  for 
w3  in  terms  of  u  and  v  implies  that 

\g(u,v)\  =  \w3 1  <  C'|u|"s+^"(uu_1)^P2+P3' 

on  any  compact  set.  We  have 

\lp2  +  P3 1  <  (IC2  +  C3)e. 
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Since  >  m  we  can,  by  choosing  5  small  enough,  make  e  so  small  that 
—  {IC2  +  C3)e  >  m.  But  then  (for  a  possibly  larger  value  of  C ) 

\g{u,v)\  =  \w3\  <  C\u\m. 

Similar  considerations,  using  (3.15)  and  (3.16),  show 

\h{u,v)\  <  C\u\m ,  \g{u,v)\  <  C\v\m ,  \h(u,v)\  <  C\v\m. 

While  this  only  shows  that  this  estimate  holds  in  a  small  neighborhood  of 
(0,  0),  we  can  extend  it  along  the  u  =  0  axis  as  follows.  From  the  construction 
the  subset  N2  :  =  {nvx  +  w2  +  g(u,v)v3  +  /i(u,u)v4  :  (u,v)  G  V}  is  invariant 
under  the  local  flow  of  Likewise  the  uu-plane,  realized  as  :=  {ttvx  + 
vw2  :  (u,v)  G  R2},  is  invariant  under  the  flow  of  <l?t,  as  is  the  line  defined  by 
u  =  0,  realized  as  R,  :=  {vv2  :  v  G  R}.  The  estimates  above  imply  that  in 
some  small  neighborhood  of  the  origin  N2  and  R'y„  have  contact  to  order  m 
along  R, .  But  the  flow  T,  is  smooth  and  therefore  preserves  the  order  of 
contact.  Thus  invariance  of  N2,  R'y.w  and  R,  under  shows  that  N2  and 
R^  have  order  m  contact  along  R,  at  all  points  that  can  be  moved  close  to 
the  origin  by  Similarly,  we  can  extend  the  estimates  along  the  v  =  0  axis. 

Finally,  from  the  construction  we  have  that  N 2  C  {II  =  0}  and  that  the 
characteristic  vector  field  £#  is  tangent  to  N2 .  Let  X  be  a  vector  tangent  to 
N 2,  then  dH(X)  =  0,  because  N2  C  {H  =  0}  and  therefore  H  is  constant  on 
N 2.  From  the  definition  of  £#  we  have  uj{^h,X)  =  — dH{X )  =0.  As  ^  is 
tangent  to  N 2  and  T{N2)  is  two  dimensional,  this  implies  that  u  =  0  at  points 
of  N2  where  £h  ^  0.  But  £#  has  an  isolated  zero  at  the  origin  and  therefore  by 
continuity  u\N2  =  0  in  some  neighborhood  of  the  origin.  This  shows  that  N2 
is  an  invariant  Lagrangian  surface  near  the  origin  and  completes  the  proof.  □ 

Recall  that  the  coordinates  x.  y,p ,  q  we  have  been  working  with  are  not  the 
standard  coordinates  on  R4,  but  rather  the  coordinates  x,y,p,q  that  reduced 
H  to  the  Sternberg  normal  form.  We  now  need  to  translate  Proposition  3.7 
back  into  a  result  about  solutions  to  the  original  equation  H(x,y,  zx,  zy)  =  0. 
However  as  the  notion  of  being  an  invariant  Lagrangian  surface  is  invariant 
under  a  symplectic  change  of  coordinates,  this  translation  does  not  involve 
much  calculation. 

Let  N2  be  the  invariant  Lagrangian  surface  from  the  conclusion  of  Propo¬ 
sition  3.7.  Then  from  the  construction  of  N2  the  two  curves  a,  and  a2  in  R4 
defined  by  (3.10)  lie  in  N2.  Thus  a)  (0)  =  vx  and  a'2(0)  =  v2  is  a  basis  for 
T{N2)0.  Then  (recalling  that  v4  =  e4  and  v2  =  e4  at  the  origin)  the  condi¬ 
tion  (3.5)  and  the  implicit  function  theorem  imply  that  by  restricting  the  size 
of  N 2  we  can  assume  that  it  projects  over  a  neighborhood  of  (0,0).  Again 
letting  7r:  R4  — *  R2  be  the  natural  projection,  we  then  have  that  the  two 
curves  ir  o  a4  and  ir  o  a2  cross  transversely  at  the  origin.  Let  a4(f)  =  7r(ai(t)) 
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and  a2 (t)  =  7r(a2(t)).  Then  these  curves  only  depend  on  the  curves  ai  and 
a2  and  thus  only  depend  on  the  Sternberg  normal  coordinates  x,y,p,q  and 
the  function  //.  but  are  independent  of  the  invariant  Lagrangian  surface  N2. 
As  ai  and  a2  are  C°°  and  cross  transversely  at  the  origin,  there  are  C°°  local 
coordinates  u,  v  centered  at  (0,0)  so  that  for  t  near  zero 

w(a2(f))  =  0,  h(a1(t))  =  0. 

3.8.  Definition.  Let  k  be  a  positive  integer,  m  a  positive  real  number,  and  U 

a  connected  neighborhood  of  the  origin  in  R2  contained  in  the  domain  of  the 
coordinates  u,v.  Then  denote  by  c£’™Jxes(U)  the  class  of  functions  defined 
exactly  as  in  Definition  3.6  but  with  the  coordinates  u  and  v  replacing  the 
coordinates  u  and  v.  □ 

With  this  definition  we  can  now  translate  Proposition  3.7  into  a  statement 
about  solutions  to  the  differential  equation  H(x,y,zx,zy)  =  0. 

3.9.  Theorem.  Use  the  notation  of  Proposition  3.7  and  assume  that  (3.f) 
and  (3.5)  hold.  Then  given  the  Ck  (with  1  <  k  <  oo)  functions  ip± :  R  — *  R 
there  is  a  solution  z  to  H(x,  y,  zx,  zy)  =  0  defined  in  a  neighborhood  U  of  the 
origin  that  has  N2((p+,  <p_)  as  its  jet  near  the  origin.  Moreover,  for  any  real 
number  m  with 

.  [bl  —  a  al  —  b  1 
m  <  mm  j  -  —  ,  ~  —  j  , 

we  have  z  e  (U) . 

Proof.  That  there  is  a  solution  z  with  N 2  =  N2(tp+,  <p_)  as  jet  follows  from 
Proposition  2.2.  As  N 2  =  {(x,y,zx,zy)  :  ( x,y )  G  U]  there  is  a  gain  in 
regularity  of  one  derivative  in  going  from  the  jet  N2  to  the  function  z.  As  N2 
has  regularity  ,  we  see  that  z  has  regularity  ^ .  This  completes 

the  proof.  □ 

3.10.  Remark.  Choosing  ip+  and  <p_  to  vanish  to  infinite  order  at  the  origin 

we  see  that  under  assumptions  (3.4)  and  (3.5)  any  saddle  type  solution  to 
H(x,  y,  zx,  zy)  =  0  will  have  an  infinite  dimensional  family  of  smooth  deforma¬ 
tions.  □ 

4.  Formal  power  series  and  non-existence  of  smooth  solutions 

In  this  section  we  will  use  power  series  methods  to  investigate  the  local 
existance  and  regularity  of  solutions  to  the  eikonal  equation, 

(4.1)  z2x  +  z2  =  h, 

near  a  point  where  h  =  0.  We  assume  that  h  is  smooth  and  that  it  has  a 
non-degenerate  Hessian  at  the  zero  we  are  considering.  We  make,  if  necessary, 
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an  affine  change  of  coordinates  of  the  type  a  translation  followed  by  a  rotation, 
to  bring  the  series  expanion  at  this  zero  into  the  form 

(4.2)  h{x,y)  =  a2x2  +  b2y2  +  ^  hm,nxrnyn . 

m+n>  3 

for  some  a,b  >  0. 

4.1.  Proposition.  If  z  is  a  C 3  solution  to  (4-1)  such  that  2(0,0)  =  0,  and  if 
a  jtz  b,  then 

(4.3)  z(x,  y )  =  ]^(±ax2  ±  by2)  +  0((|x|  +  \y\ )3). 

If  a  =  b  then  (4-3)  still  holds  after  making  a  change  of  coordinates 


(4.4) 


x  cos  £  sin  £  x 

y  —  sin  £  cos  £  y  ’ 


for  some  angle  of  rotation  £. 

Proof.  Suppose  that  2  has  a  series  expansion  z(x,y)  =  ax  +  (3y  +  |( Ax 2  + 
2 Bxy  +  Cy2)  +  0((|x|  +  \y\ )3).  Then  z2  +  z2  =  a2  +  (32  +  0(\x\  +  \y\).  But  h 
vanishes  at  (0,0)  which,  along  with  (4.1),  implies  that  a  =  f3  =  0.  Evaluating 
the  second  order  terms  of  z2  +  z2  and  equating  them  with  the  second  order 
terms  of  (4.2),  we  get  the  equations 

(4.5)  A2 +  B2  =  a2,  B{A  +  C)  =  0,  B2  +  C2  =  b2. 


Suppose  first  that  a  ^  b,  then  we  must  have  B  =  0  since  otherwise  the 
middle  equation  would  imply  that  A  =  —  C  which,  together  with  the  first 
and  the  last  equations,  contradict  the  assumption  that  a  ^  b.  It  then  follows 
immediately  that  A  =  ±a  and  C  =  ±6,  and  so  (4.3)  holds.  This  holds  also  if 
a  =  b  and  B  =  0,  i.e.  A  =  ±a  and  C  =  ±a  with  any  combination  of  signs. 

If  a  =  b  and  B/  0,  then  any  solution  of  (4.5)  is  of  the  form 

Aq  =  PaV  1  —  #2,  Bg  =  ad,  Cg  =  Tct-Vl  —  92, 

for  some  Be  [—1,  1],  6  ^  0.  For  a  given  9,  the  function  2  =  |  (Agx2  +  2  Bgxy  + 
Cgy2)  +  •  •  •  is  brought  to  the  form  (4.3)  by  the  change  of  variables  (4.4),  if  £ 
is  chosen  such  that  tan£  =  9/(1  +  Vl  —  92).  □ 


As  the  convex  and  concave  solutions  are  well  understood  (these  are  the  ones 
corresponding  to  leading  terms  +(ax2  +  by2)/ 2),  we  will  focus  on  the  saddle 
solutions.  Thus  assume  that  2  has  Taylor  expansion 

z(x,y)  =  ^( ax 2  -  by2)  +  zm)nxmyn. 

m+n>  3 


(4.6) 
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(Note  that,  if  a  =  b,  we  work  with  the  rotated  coordinates.)  Then 

zx  =  ax  +  ^2  mzm,nxm~1yn 

m+n>  3 

so  that 

z2x  =  a2x2  +  ^2  ‘^mazrri)nxrn yn  +  f  ^  mzm^xm~xyn 

m+n>  3  \m+n>  3 


In  (Em+„>3  mzm^xm  1yn )2  we  get  terms 

mzm^nxrn~1ynkz^ixk~lyl  =  7nkzmjnzkjixm+k~2yn+l  =  mkzm^zKixpyq , 

where  p  +  q  =  m  +  n  +  k  +  l  —  2  >  fc  +  i  +  1  as  m  +  n  >  3  and  likewise 
p  +  q  >  m  +  n  +  1  as  fc  + 1  >  3.  Thus  there  are  polynomials  in 

the  variables  Zjj  with  i  +  j  <  m  +  n  such  that 


Putting  this  all  together  (and  setting  Pm+n  =  0  when  m  +  n  =  3)  we  get 

z2x  =  aV  +  ^  (2.M4w+PW<“"lfelj)W. 

m+n>  3 


Likewise, 

zy  =  -bx+  ^2  nzm,nXmyn~1 , 
m+n>  4 

so  a  similar  calculation  gives 

4  =  bV+  J2  {-Znbzm,n  +  Q^<rn+n\zhj))xmyn. 

m+n>  3 


Plugging  these  equations  into  (4.1)  we  get  the  recursion 
(4.7)  2 (ma  -  nb)zm,n  =  hm,n  -  (Pj%<m+n](zitj)  +  Q 


[i+j<m+n\ 

m,n 


4.2.  Theorem.  Let  h  be  a  smooth  function  so  that  the  Taylor  expansion  of 
h(x,y )  near  (0,0)  is  of  the  form  (f.2),  where  a  and  b  are  positive  and  linearly 
independent  over  the  rational  numbers.  Then  at  the  level  of  formal  power 
series  there  is  a  unique  saddle  point  solution  z  to  (4-1)  with  leading  terms  of 
the  form  (4-6) 


Proof.  If  a  and  b  are  linearly  independent  over  the  rationals,  then  ma—mb  ^  0 
for  all  m  and  n.  Thus  the  equation  (4.7)  uniquely  determines  all  the  coefficients 
of  2;  as  required.  □ 
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4.3.  Corollary.  With  the  same  hypothesis  as  in  the  last  theorem,  there  is  a 
smooth  function  z  so  that  z2  +  z2  —  h  vanishes  to  infinite  order  at  the  origin 
(0,0). 

Proof.  This  follows  from  the  well  known  theorem  of  Borel  (c/.  [3]  and  [8,  Thm. 
1.2.6.  p.  16])  that  given  any  formal  power  series  there  is  a  smooth  ( i.e  C°° ) 
function  having  the  given  formal  power  series  as  its  Taylor  series.  Thus  if  z  is 
the  formal  power  series  solution  given  by  the  theorem,  then  let  5  be  a  smooth 
function  that  has  2)  as  its  Taylor  expansion.  □ 


4.4.  Theorem.  Let  a  and  b  be  positive  real  numbers  that  are  linearly  depen¬ 
dent  over  the  rationals.  Then  there  is  a  smooth  function  h  of  the  form  (4-2) 
such  that  there  is  no  solution  z  of  the  saddle  point  form  (4-6)  to  the  equa¬ 
tion  (4-1)-  In  particular,  if  m  and  n  are  positive  integers  with  ma  —  nb  =  0 
and  m  +  n  >  4  and  c  ^  0,  then  for  h  =  a2x 2  +  b2y2  +  cxmyn  the  equation  (4-1) 
will  not  have  any  saddle  point  solution  in  formal  power  series  (or  any  Ck 
saddle  point  solution  for  k  >  m  +  n.) 


Proof.  If  a  and  b  are  positive  and  dependent  over  the  rationals,  then  there  are 
positive  integers  m  and  n  with  m  +  n  >  4  so  that  ma  —  nb  =  0.  Therefore,  if 
there  is  a  solution  z  to  (4.1)  of  the  form  (4.6),  the  equation  (4.7)  implies 


But  then,  by  adding  cxmym  to  h,  c  ^  0,  we  get  a  new  h  of  the  form  (4.2)  so 
that  there  is  no  solution  to  (4.1)  in  formal  power  series.  For  a  specific  example, 
let  ho  =  a2x 2  +  b2y 2  and  z0  =  |( ax 2  —  by2).  Then  (z0)l  +  (zo)2  =  h0  and  the 
equation  z2  +  z2  =  ho  +  cxmyn  will  have  no  solution  in  formal  power  series,  if 
ma  —  nb  =  0  and  c  ^  0.  □ 


A.  Appendix:  The  stable  submanifold  theorem  and  the 
Sternberg  normal  form 

A.l.  The  stable  submanifold  theorem.  The  following  is  a  statement  of 
the  Stable  Submanifold  Theorem  adapted  to  our  applications.  If  k  >  1  and 
X  is  a  Ck  vector  field  on  a  smooth  n-dimensional  manifold  M,  we  denote 
by  'I*;  the  flow  of  X.  That  is,  is  the  locally  defined  one  parameter  group 
of  diffeomorphisms  of  M  that  satisfy  J^4>t(P)  =  X(P).  As  X  is  Ck,  the 
function  (P,t)  H>  <f*t(P)  is  also  Ck  ( cf .  [1,  p.  230]  or  [9,  Thm  5  p.  86]).  A 
point  P0  e  M  is  a  hyperbolic  critical  point  of  X  iff  X(P0)  =  0  and  the 
linearization  of  X  at  P0,  L :  T(M)Po  — *  T(M)Po,  has  no  eigenvalue  with  zero 
real  part.  This  implies  that  P0  is  an  isolated  critical  point  of  X.  Letting, 
as  usual,  etL  =  Y^k=o (FL)/&!,  the  stable  subspace  of  X  at  P0  is  the  linear 
subspace  of  T(M)Po  defined  by 

T+(M)Po  :=  {v  G  T(M)Po  :  lim  etLv  =  0}. 

£—>•00 
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The  unstable  subspace  is  likewise  defined  by 

T_(M)Po  :=  {v  G  T(M)Po  :  lim  etLv  =  0}. 

t—^—oo 

If  no  eigenvalue  of  L  has  zero  real  part,  then  basic  linear  algebra  implies  the 
direct  sum  decomposition 

(A.l)  T(M)Po  =  T-(M)Po  ©  T+(M)Po. 

When  all  the  eigenvalues  of  L  are  real  and  distinct  (which  is  the  case  for  most 
of  the  applications  in  this  paper)  then  T+(M)Po  is  the  span  of  the  eigenvectors 
corresponding  to  the  negative  eigenvalues  and  T_(M)p0  is  the  span  of  the 
eigenvectors  corresponding  to  the  positive  eigenvalues. 

A.l.  Theorem  (Stable  Submanifold  Theorem).  If  k  >  1  and  P0  is  a  hyper¬ 
bolic  critical  point  of  the  Ck  vector  field  X,  then  there  is  a  connected  open 
neighborhood  U  of  P0  such  that 

N+:={PeU  :  lim  $t(P)  =  P0} 

t — ^  ~ |~oo 

(the  stable  submanifold  of  X  at  P0)  and 

N_  :=  {P  6  U  :  lim  $t(P)  =  P0} 

t — y — OO 

(the  unstable  submanifold  of  X  at  P0)  are  both  embedded  Ck  submanifolds 
of  U  .  The  tangent  space  to  N±  at  P0  is  T±(M)Po  so  dim  A  =  dim  T±(M)Po 
and  therefore  (A.l)  implies  that  N+  and  iV_  intersect  transversely  and  that 
dim  N+  +  dim  A_  =  dim  M.  □ 

This  was  originally  proven  in  various  degrees  of  generality  by  Hadamard 
[5],  Liapunov  [11]  and  Perron  [13].  Modern  presentations  of  the  proof  can  be 
found  in  [6]  and  [16,  Chap.  6]. 

A. 2.  The  Sternberg  normal  form.  Let  H  be  a  smooth  function  on  R4  and 
(h  the  characteristic  vector  field  of  H  as  given  by  (2.3).  Let  P  be  a  critical 
point  of  II  (and  thus  a  rest  point  of  and  let  L  be  the  linearization  of 
at  P.  Then  the  matrix  of  L  is  given  by  equation  (2.4).  If  the  eigenvalues  of  L 
are  real  and  nonzero,  then  the  formulas  (2.5)  imply  that  they  are  of  the  from 
a,  b ,  —a,  —b,  for  some  a,  b  >  0.  With  this  in  mind,  we  can  give  Sternberg’s 
normal  form  for  a  Hamiltonian  near  a  critical  point. 

A. 2.  Theorem  (Sternberg  [19]).  Let  H  be  a  smooth  function  on  R4  and  P 
a  critical  point  of  H  with  H(P)  =  0.  Assume  that  the  eigenvalues  of  the 
linearization  L  of  £#  at  P  are  a,  b,  —a,  —b  where  a,  b  >  0  and  that 

(A. 2)  a  and  b  are  linearly  independent  over  the  rational  numbers. 

Then  there  are  local  coordinates  x,  y,  p,  q  centered  at  P  and  a  smooth  function 
f(u,v )  of  two  variables  so  that 
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1.  x,y,p,q  are  symplectic  coordinates.  That  is,  if  u  is  the  symplectic  form 
then 

u  =  dp  A  dx  +  dq  A  dy. 

2.  The  function  f  satisfies  f(u,  v )  =  u  +  v  +  0(u 2  +  u2)  or  more  formally: 

/(0,  0)  =  0,  /u(0,0)  =  /„(0,0)  =  1, 

3.  /n  these  coordinates  H  is  given  by 

H  =  ^  f{p 2  -  a2x2,q 2  -  b2y2). 

This  follows  from  Thm.  9  in  Sternberg  [19,  p.  603]  applied  to  the  transforma¬ 
tions  given  by  the  flow  of  a  Hamiltonian  vector  field,  using  the  considerations 
from  Section  7  in  Sternberg  [18,  p.  818  -  819]. 
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